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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .1 7(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
05/10/2010 has been entered. 

Response to Arguments 

1 . Applicant's arguments filed 05/1 0/201 0 have been fully considered but they are 
not persuasive. 

2. As to Applicant argument that the examiner ignored applicants' technical analysis 
and reasoning throughout the prosecution of this application and is requested to 
address each of applicants' arguments presented in the Request for reconsideration 
filed on April 9, 2010 by providing analysis and explanation regarding any alleged 
deficiencies in the technical reasoning supporting these patentability arguments, and 
not by merely reiterating large portions of the cited reference(s) and simply concluding 
that those portions teach the claimed features. 

3. The Examiner respectfully disagrees and it is unclear to which arguments 
applicant asserts that that examiner did not address. 

4. As to Applicants argument that Chakraborty does not disclose or suggest the 
claimed distinction between a shot and a scene. 
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5. The Examine respectfully disagrees. Chakraborty discloses where a "shot" or 
"take" in video parlance refers to a contiguous recording of one or more video frames 
depicting a continuous action in time and space. Typically, transitions between shots 
(also reference to as "scene changes" or "cuts") are created intentionally by film 
directors, col. 1 line 35-40. A shot is one or more frames and since the transition 
between shots is considered a scene change, clearly the sequence of the one or more 
shots before a transition is suggested to be a scene. 

6. As to Applicants argument that the various metrics taught by Chakraborty cannot 
correspond to the "claimed calculator for calculating shot density DS of the video from 
the respective shots" and "calculator for calculating motion intensity of the respective 
shots" and dynamic/static scene classifier for classifying the respective shots into a 
dynamic scene with much motion or a static scene with little motions based on the shot 
density and the motion intensity" because no shot has yet been determined at the time 
the various metrics are used to analyze the video. 

The examiner respectfully disagrees. It is the combination of the Chakraborty and 
Toklu that teaches the applicants claimed invention. Chakraborty teaches a video 
processing system and method employing an integrated processor for segmenting a 
video into shots by successfully identifying scene changes, both abrupt and gradual, 
see col. 4 line 65 to col. 5 line 7. Further, Chakraborty discloses where the output of 
each of the scene change detection processes are potential shots/scene change 
location based on the respective metrics (steps 21 1 ,212, and 213), both abrupt and 
gradual. The next steps in the preferred scene change detection process involve 
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identifying and validating the scene changes based on various conditions. For instance, 
referring to FIG. 2b, in the preferred embodiment, abrupt scene changes are identified 
where candidate scene change location output from the shot detection processes of the 
interframe and histogram difference metrics are in agreement (step 214). In particular, 
abrupt scene changes are identified by verifying that the conditions regarding both the 
interframe difference metric and the difference are satisfied. It is to be appreciated that 
by integrally utilizing the scene change candidates output from such shot detection 
processes, false alarms in identifying scene changes that may occur due to small 
motion where the interframe difference is high (and thus exceed the threshold in 
equation 1 1 above) will not occur since the condition for the histogram difference for the 
candidate must also be satisfied. (In which case, for small motion, such condition 
typically will not be satisfied), Column 12 line 60 to column 13 line 14 and fig. 2A-2B. 
Further regarding fig. 2A, element 211, outputs potential shot/scene change locations 
based on interframe difference. Since fig. 2A, element 21 1 , outputs potential shot/scene 
change locations based on interframe difference, it is clear to the Examiner that 
Chakraborty discloses to calculate the motion of the shot, which reads upon the claimed 
limitation. Since Chakraborty discloses where the output of each of the scene change 
detection processes are potential shots/scene change location based on the respective 
metrics (steps 211,212, and 213), both abrupt and gradual. The next steps in the 
preferred scene change detection process involve identifying and validating the scene 
changes based on various conditions. For instance, referring to FIG. 2b, in the preferred 
embodiment, abrupt scene changes are identified where candidate scene change 
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location output from the shot detection processes of the interframe and histogram 
difference metrics are in agreement (step 214). In particular, abrupt scene changes are 
identified by verifying that the conditions regarding both the interframe difference metric 
and the difference are satisfied. It is to be appreciated that by integrally utilizing the 
scene change candidates output from such shot detection processes, false alarms in 
identifying scene changes that may occur due to small motion where the interframe 
difference is high (and thus exceed the threshold in equation 1 1 above) will not occur 
since the condition for the histogram difference for the candidate must also be satisfied. 
(In which case, for small motion, such condition typically will not be satisfied), Column 
12 line 60 to column 13 line 14 and fig. 2A-2B. Further regarding fig. 2A, element 211, 
outputs potential shot/scene change locations based on interframe difference, it is clear 
to the Examiner that Chakraborty discloses to determine if the scene is gradual or static, 
which reads upon the claimed limitation. Toklu discloses that conventional video 
summarization methods typically include segmenting a video into an appropriate set of 
segments such as video "shots" and selecting one or more key-frames from the shots, 
col. 1 line 33-37. Further, a video segmentation module 12 partitions a video file (that is 
either retrieved from the data base 1 1 or input in real— time as a video data stream) into 
a plurality of video segments (or video segments) and then outputs segment boundary 
data corresponding to the input video data. 

7. Therefore, since Chakraborty teaches to calculate the motion of a shot, where 
the scene change detection processes are potential shots/scene change location based 
on the respective metrics, both abrupt and gradual and the histogram difference metrics 
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which reads upon the density and Toklu discloses a segmentation module that partitions 
a video file into a plurality of video segments such as shots, it is clear to the examiner 
that Chakraborty modified by Toklu discloses to perform video segmentation for video 
data into shots and calculating the histogram difference metric, whether the shot is 
gradual or abrupt, which reads upon the claimed limitation. In response to applicant's 
arguments against the references individually, one cannot show nonobviousness by 
attacking references individually where the rejections are based on combinations of 
references. See In re Keller, 642 F.2d 413, 208 USPQ 871 (CCPA 1981 ); In re Merck & 
Co., 800 F.2d 1091, 231 USPQ 375 (Fed. Cir. 1986). 

8. As to Applicants argument that Chakraborty does not disclose or suggest the 
claimed distinction between a shot and a scene. 

9. The Examiner respectfully disagrees. Chakraborty discloses where a "shot' or 
"take" in video parlance refers to a contiguous recording of one or more video frames. 

1 0. As to Applicants argument regarding that none of the cited references disclose or 
suggest any of the features recited in claims 1, 4, 9, 13 and 14, other than the claimed 
"a shot segmentation device to segment the video into respective shots," because none 
of the references teach or suggest performing the claimed operations on the segmented 
shots or classifying a scene including a plurality of continuous shots. 

1 1 . The Examiner respectfully disagrees. It is the combination of the Chakraborty 
and Toklu as a whole that teaches the claimed limitations. Further, Chakraborty 
(modified by Toklu) as a whole disclose to perform the limitations of the claims on 
segmented shots and classifying a scene including a plurality of continuous shots. Toklu 
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discloses where conventional video summarization method typically include segmenting 
a video into an appropriate set of segments such as video "shots", Col. 1 line Further, 
taught is a video segmentation module 12 partitions a video file (that is either retrieved 
from the database 1 1 or input in real-time as a video data stream) into a plurality of 
video segments (or video segments) and then outputs segment boundary data 
corresponding to the input video data. It is to be understood that that any conventional 
process may be employed herein for partitioning video data which is suitable for 
implementation with the present invention. The above-incorporated cut detection 
method partitions video data into a set of "shots" comprising visually abrupt cuts or 
camera breaks. As stated above, a video "shot" represents a contiguous recording of 
one or more video frames depicting a continuous action in time and space, col. 5 line 
39-60. Since Toklu discloses to the method partitions video data into a set of "shot" 
comprising visually abrupt cuts or camera break, it is clear to the Examiner that Toklu 
segments or classifies continuous shots as either abrupt or as a camera break, which 
reads upon performing operations segmented shots and classifying continuous shots. 
Therefore, Chakraborty (modified by Toklu) discloses to perform segmentation of video 
into shots, classify the shots as well as classify the shots before performing the 
operations of Chakraborty. In response to applicant's arguments against the references 
individually, one cannot show nonobviousness by attacking references individually 
where the rejections are based on combinations of references. See In re Keller, 642 
F.2d 413, 208 USPQ 871 (CCPA 1 981 ); In re Merck & Co., 800 F.2d 1 091 , 231 
USPQ 375 (Fed.Cir. 1986). 
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12. As to Applicants argument that Gonsalves does not disclose or suggest anything 
about an "inserting means [that] makes a type of the video transition effect to be 
inserted different according to whether the highlight scenes to be combined are the 
dynamic scene or the static scene". 

1 3. The Examiner respectfully disagrees. It is the combination of Nakamura 
(modified by Pan and Gonsalves) that teaches applicants limitation. In this case, 
Gonsalves teaches allowing the video editor to insert a video transition effect on a 
field/frame by field/frame basis in order to improve accuracy of the effect, see col. 3 line 
11-14, and 24, between two frames, col. 4 line 65-67, col. 5 line 50-52 and fig. 3b:320A- 
320b). Taking the teachings of Nakamura (modified by Pan) where Pan discloses where 
a special effect, or edit effect at block 16, is almost always present between the normal 
shots in block 12 ands 16 and the slow motion replay segment in block 18. After the 
slow motion replay in block 18, another edit effect in block 20, is usually present before 
resuming normal play, [0020]. The edit effects in 20 and edit effects out 30, mark the 
starting and end points of the procedure of the slow motion replay 14, and typically are 
gradual transitions, such as fade in/out, cross/additive-dissolve, and wipes, [0030] with 
the teachings of Gonsalves where it is disclosed to implement special effects on a 
field/frame by field/frame basis, it is clear to the Examiner that the combination is fully 
capable and suggest to insert special effect on a frame by frame or field by field basis, 
where inserted edit effects for slow motion replay (static highlight scene with little 
motion) are gradual. 
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14. Since there is following the action shot a normal shot, and almost always there is 
an edit effect between the normal shot and slow motion replay segment, and edit effects 
in 20 and edit effects out 30, mark the starting and end points of the procedure of the 
slow motion replay 14, and typically are gradual transitions, such as fade in/out, 
cross/additive-dissolve, and wipes, it is clear to the Examiner that for the slow motion 
replay (static highlight scene), the effects in and out are gradual, which reads upon the 
claimed limitation. 

1 5. As to Applicants argument that the edit effect of Pan is not inserted into a 
combined portion of highlight scene and Pan is silent as to what parameters insertion of 
the edit effect is based. That is, Pan does not disclose an inserting means that includes 
"a dynamic/static scene detector to detect whether a highlight scene is a dynamic scene 
with much motion or a static scene with little motion". 

16. The Examiner respectfully disagrees. Pan discloses where a special effect, or 
edit effect at block 16, is almost always present between the normal shots in block 12 
ands 16 and the slow motion replay segment in block 18. After the slow motion replay in 
block 18, another edit effect in block 20, is usually present before resuming normal play, 
[0020]. The edit effects in 20 and edit effects out 30, mark the starting and end points of 
the procedure of the slow motion replay 14, and typically are gradual transitions, such 
as fade in/out, cross/additive-dissolve, and wipes, [0030]. Since there is following the 
action shot a normal shot, and almost always there is an edit effect between the normal 
shot and slow motion replay segment, and edit effects in 20 and edit effects out 30, 
mark the starting and end points of the procedure of the slow motion replay 14, and 
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typically are gradual transitions, such as fade in/out, cross/additive-dissolve, and wipes, 
it is clear to the Examiner that for the slow motion replay (static highlight scene), the 
effects in and out are gradual, which reads upon the claimed limitation. 

Claim Rejections - 35 USC §112 

1 7. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

18. Claim 1 recites the limitation "the image" in claim 1, line 13. There is insufficient 
antecedent basis for this limitation in the claim. 



Claim Rejections - 35 USC § 103 

1 9. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

20. The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1 , 148 
USPQ 459 (1966), that are applied for establishing a background for determining 
obviousness under 35 U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at issue. 

3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating 
obviousness or nonobviousness. 
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21. Claims 1-3, 15, and 17-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Chakraborty et al., US-7, 110,454 in view of Toklu et al., US- 
6,549,643 and further in view of Park et al., US-6,597,738. 

Re claim 1, Chakraborty discloses a scene classification apparatus of video for 
classifying a sequence of shots into a dynamic scene with much motion or a static 
scene with little motion, where the dynamic scene and the static scene respectively 
include a plurality of continuous shots and are thus a larger unit than a shot, comprising: 
a calculator for calculating shot density (histogram difference metric, a histogram is a 
graphical display of tabulated frequencies and fig. 2A: 203) DS of the video per a time 
unit(fig. 3C) the from the respective shots (extracted video frames, fig. 2); a calculator 
for calculating motion intensity (histogram difference metric, a histogram is a graphical 
display of tabulated frequencies and fig. 2A:203. Further regarding claim 2A, the 
element 212 output the potential shot/scene change location based on histogram 
difference, therefore, it is clear to the Examiner that Chakraborty teaches to disclose the 
density, which reads upon the claimed limitation) ; a calculator for calculating motion 
intensity of the respective shot (fig. 2A, element 211, outputs potential shot/scene 
change locations based on interframe difference. Since fig. 2A, element 21 1 , outputs 
potential shot/scene change locations based on interframe difference, it is clear to the 
Examiner that Chakraborty discloses to calculate the motion of the shot, which reads 
upon the claimed limitation) of the respective shots (extracted video frames, fig. 2); and 
a dynamic/static scene classifier (metric computation col. 5 line 9-11, fig. 1:14-17 and 
fig. 2A) for classifying the sequence (continuous units or "shots" col. 1 line 35-37) of 
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shots into the dynamic scene (abrupt scene, see abstract, furthermore, the meaning of 
abrupt is interpreted as sudden or fast) with much motion or the static scene with little 
motion (gradual scene, see abstract, furthermore, the meaning of gradual is interpreted 
as slow and not moving quickly) based on the shot density (histogram difference, a 
histogram is a graphical display of tabulated frequencies) and the motion intensity of the 
respective shots (Chakraborty discloses where the output of each of the scene change 
detection processes are potential shots/scene change location based on the respective 
metrics (steps 21 1,212, and 213), both abrupt and gradual. The next steps in the 
preferred scene change detection process involve identifying and validating the scene 
changes based on various conditions. For instance, referring to FIG. 2b, in the preferred 
embodiment, abrupt scene changes are identified where candidate scene change 
location output from the shot detection processes of the interframe and histogram 
difference metrics are in agreement (step 214). In particular, abrupt scene changes are 
identified by verifying that the conditions regarding both the interframe difference metric 
and the difference are satisfied. It is to be appreciated that by integrally utilizing the 
scene change candidates output from such shot detection processes, false alarms in 
identifying scene changes that may occur due to small motion where the interframe 
difference is high (and thus exceed the threshold in equation 1 1 above) will not occur 
since the condition for the histogram difference for the candidate must also be satisfied, 
(in which case, for small motion, such condition typically will not be satisfied), Column 
12 line 60 to column 13 line 14 and fig. 2A-2B. Further regarding fig. 2A, element 211, 
outputs potential shot/scene change locations based on interframe difference, it is clear 
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to the Examiner that Chakraborty discloses to determine if the scene is gradual or static, 
which reads upon the claimed limitation.). 

Chakraborty does not explicitly teach segmented shots; shot segmentation 
device to segment the video into respective shots. However, Toklu teaches a shot 
segmentation device to segmentation device to segment the video into respective shots 
(video segmentation module 12, column 5 line 38-57, and fig 1 element 12). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to incorporate the teachings of Toklu with Chakraborty to generate a 
content based visual summary of video and facilitate digital video browsing and 
indexing, column 3 line 40-43). 

Chakraborty (modified by Toklu) is silent in regards to calculating motion intensity 
per unit region on the image using a value of motion vectors . 

However, Park teaches calculating motion intensity per unit region on the image 
using a value of motion vectors (Park discloses where the motion vector values MVoz, 
MVoy, which are generated in the motion search value divergence processing part 2, 
are input (S14), the motion intensity computing part 13 obtains the motion intensity Lmv 
by using the formula (2) wherein, p is the quantization factor of intensity. If the 
computed motion intensity Lmv is smaller than a threshold value THL (S34), the motion 
vector comparison/conversion part 23 regards that there is a motion in the image which 
is a too small intensity to visualize or random noise occurs in the image during image 
obtaining or processing, so that the motion vector comparison/conversion part 23 
converts all estimation motion values MVfx and MVfy into 0 (S44). Oh the other hand, if 



Application/Control Number: 10/670,245 Page 14 

Art Unit: 2621 

the computed motion intensity Lmv is larger than the threshold value THL, the motion 
vector comparison/conversion part 23 converts the motion vector values Mvx, Mvy into 
all estimated motion values MVfx and MVfy (S54), col. 16 line 20-49 and fig. 14). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to incorporate the teachings of Park with Chakraborty (modified by 
Toklu) for to increase the speed and efficiency of data search, it has been researched 
and developed new search techniques which include the widely-known character-based 
search technique and have composite information attribute, thereby being suitable for 
efficient data search of multimedia (col. 1 line 30-36). 

Regarding claim 2, Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claim 1. In addition Chakraborty discloses 
the scene classification apparatus of video according to claim 1, wherein the 
dynamic/static (metric computation col. 5 lines 9-11, fig. 1:14-17 and fig. 2A) scene 
classifier classifies a sequence of shots whose shot density (histogram difference, a 
histogram is a graphical display of tabulated frequencies) is larger than first reference 
density and whose motion intensity is stronger than first reference intensity (frame to 
frame intensity col. 1 lines 50-53) into the dynamic (abrupt col. 12, line 67; col. 13 line 1- 
3) scene. 

Regarding claim 3, Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claim 1. In addition Chakraborty discloses 
the scene classification apparatus of video according to claim 1, wherein the 
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dynamic/static scene detector (metric computation col. 5 lines 9-1 1 , fig. 1 :1 4-1 7 and fig. 
2A) classifies a shot whose shot density (histogram difference, a histogram is a 
graphical display of tabulated frequencies) is smaller than second reference density and 
whose motion intensity (histogram difference computation fig. 1 :16) is weaker than 
second reference intensity into the dynamic scene (gradual scene). 

(2) Regarding claim 15, Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claims 1 or 4. In addition, Chakraborty 
discloses the scene classification apparatus of video according to claim 1 or 4, wherein 
the video are compressed data (video source may be either compressed or 
decompressed video data, col. 6 lines 45-46). However, Chakraborty silent in regards 
to the motion intensity is detected by using a value of a motion vector of a predictive 
coding image existing in each shot. 

However, Park teaches motion intensity is detected by using a value of a motion 
vector of a predictive coding image existing in each shot (column 16 line 20-35 and fig. 
14). 

Therefore, it would have been obvious for one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
Parks' teaching of motion intensity detected by motion vectors to increase the speed 
and efficiency of data search, it has been researched and developed new search 
techniques which include the widely-known character-based search technique and have 
composite information attribute, thereby being suitable for efficient data search of 
multimedia (column 1 line 30-36). 
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(2) Regarding claim 15, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claims 1 or 4. In addition, Chakraborty discloses the 
scene classification apparatus of video according to claim 1 or 4, wherein the video are 
compressed data (video source may be either compressed or decompressed video 
data, col. 6 lines 45-46). However, Chakraborty silent in regards to the motion intensity 
is detected by using a value of a motion vector of a predictive coding image existing in 
each shot. 

However, Park teaches motion intensity is detected by using a value of a motion 
vector of a predictive coding image existing in each shot (column 16 line 20-35 and fig. 
14). 

Therefore, it would have been obvious for one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
Parks' teaching of motion intensity detected by motion vectors to increase the speed 
and efficiency of data search, it has been researched and developed new search 
techniques which include the widely-known character-based search technique and have 
composite information attribute, thereby being suitable for efficient data search of 
multimedia (column 1 line 30-36). 

Regarding claim 17 Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claim9. In addition, Chakraborty discloses 
the scene classification apparatus of video according to claim 9, wherein the video are 
compressed data video source may be either compressed or decompressed video data, 
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col. 6 lines 45-46). Chakraborty is silent in regards to the histogram of motion direction 
is detected by using a value of a motion vector of a predictive coding image existing in 
each shot. 

However, Park teaches the histogram of motion direction is detected by 
using a value of a motion vector of a predictive coding image existing in each shot 
(Park, column 16 line 63 to column 17 line 10, column 22 line 31-49, column 18 line 29- 
31, fig. 9 and fig. 14). 

Therefore, it would have been obvious for one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified) with Parks' 
teaching of a histogram of motion direction is detected by using a value of a motion 
vector of a predictive coding image existing in each shot to increase the speed and 
efficiency of data search, it has been researched and developed new search techniques 
which include the widely-known character-based search technique and have composite 
information attribute, thereby being suitable for efficient data search of multimedia 
(column 1 line 30-36). 

Regarding claim 18, Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claim 1or 4. In addition, Chakraborty 
discloses the scene classification apparatus of video according to claims 1 or 4, wherein 
the video are uncompressed data (video source may be either compressed or 
decompressed video data, col. 6 lines 45-46). However, Chakraborty is silent in 
regards to the motion intensity (is detected by using a value of a motion vector 
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representing a change in motion predicted from a compared result of frames composing 
the respective shots. 

However, Park teaches the motion intensity is detected by using a value of a 
motion vector representing a change in motion predicted from a compared result of 
frames composing the respective shots (Park, column 1 1 , line 66 to column 1 2 line 7 
and column 24 line 55-60, and column 18 line 29-31). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
the Parks' teaching of the motion intensity (is detected by using a value of a motion 
vector representing a change in motion predicted from a compared result of frames 
composing the respective shots, to increase the speed and efficiency of data search, it 
has been researched and developed new search techniques which include the widely- 
known character-based search technique and have composite information attribute, 
thereby being suitable for efficient data search of multimedia (column 1 line 30-36). 

(2) Regarding claim 18, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claims 1 or 4. In addition, Chakraborty discloses the 
scene classification apparatus of video according to claims 1 or 4, wherein the video are 
uncompressed data (video source may be either compressed or decompressed video 
data, col. 6 lines 45-46). However, Chakraborty is silent in regards to the motion 
intensity (is detected by using a value of a motion vector representing a change in 
motion predicted from a compared result of frames composing the respective shots. 
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However, Park teaches the motion intensity is detected by using a value of a 
motion vector representing a change in motion predicted from a compared result of 
frames composing the respective shots (Park, column 1 1 , line 66 to column 1 2 line 7 
and column 24 line 55-60, and column 18 line 29-31). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
the Parks' teaching of the motion intensity (is detected by using a value of a motion 
vector representing a change in motion predicted from a compared result of frames 
composing the respective shots, to increase the speed and efficiency of data search, it 
has been researched and developed new search techniques which include the widely- 
known character-based search technique and have composite information attribute, 
thereby being suitable for efficient data search of multimedia (column 1 line 30-36). 

Regarding claim 19, Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claims 1 or 4. In addition, Chakraborty 
discloses the scene classification apparatus of video according to claims 1 or 4, wherein 
the video are uncompressed data (video source may be either compressed or 
decompressed video data, col. 6 line 45-46). However, Chakraborty is silent in regards 
to the spatial distribution of motion is detected by using a value of a motion vector 
representing a change in motion predicted from a compared result of frames composing 
the respective shots. 

However, Park teaches the spatial distribution of motion is detected by using a 
value of a motion vector representing a change in motion predicted from a compared 
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result of frames composing the respective shots (Park, column 23 line 20-30. Further 
Park discloses the motion direction is computed from the motion vector values, column 
16 line 62-65 and column 18 line 29-31). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
Parks' teaching of spatial distribution of motion is detected by using a value of a motion 
vector representing a change in motion, to increase the speed and efficiency of data 
search, it has been researched and developed new search techniques which include 
the widely-known character-based search technique and have composite information 
attribute, thereby being suitable for efficient data search of multimedia (column 1 line 
30-36). 

(2) Regarding claim 19, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claims 1 or 4. In addition, Chakraborty discloses the 
scene classification apparatus of video according to claims 1 or 4, wherein the video are 
uncompressed data (video source may be either compressed or decompressed video 
data, col. 6 line 45-46). However, Chakraborty is silent in regards to the spatial 
distribution of motion is detected by using a value of a motion vector representing a 
change in motion predicted from a compared result of frames composing the respective 
shots. 

However, Park teaches the spatial distribution of motion is detected by using a 
value of a motion vector representing a change in motion predicted from a compared 
result of frames composing the respective shots (Park, column 23 line 20-30. Further 
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Park discloses the motion direction is computed from the motion vector values, column 
16 line 62-65 and column 18 line 29-31). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
Parks' teaching of spatial distribution of motion is detected by using a value of a motion 
vector representing a change in motion, to increase the speed and efficiency of data 
search, it has been researched and developed new search techniques which include 
the widely-known character-based search technique and have composite information 
attribute, thereby being suitable for efficient data search of multimedia (column 1 line 
30-36). 

Regarding claim 20 Chakraborty (modified by Toklu and Park) as a whole 
teaches everything as claimed above, see claims 1or 4. In addition Chakraborty 
discloses the scene classification apparatus of video according to claims 1 and 4, 
wherein the video are uncompressed data (video source may be either compressed or 
decompressed video data, col. 6 line 45-46). Chakraborty is silent in regards to the 
histogram of motion direction (histogram difference metric) is detected by using a value 
of a motion vector representing a change in motion predicted from a compared result of 
frames composing the respective shots. 

However, Park teaches the histogram of motion direction is detected by using a 
value of a motion vector representing a change in motion predicted from a compared 
result of frames composing the respective shots (column 1 1 line 14-27, column 18 line 
29-31 and fig. 1J). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
Parks' teaching of spatial distribution of motion is detected by using a value of a motion 
vector representing a change in motion, to increase the speed and efficiency of data 
search, it has been researched and developed new search techniques which include 
the widely-known character-based search technique and have composite information 
attribute, thereby being suitable for efficient data search of multimedia (column 1 line 
30-36). 

(2) Regarding claim 20 Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claims 1or 4. In addition Chakraborty discloses the 
scene classification apparatus of video according to claims 1 and 4, wherein the video 
are uncompressed data (video source may be either compressed or decompressed 
video data, col. 6 line 45-46). Chakraborty is silent in regards to the histogram of motion 
direction (histogram difference metric) is detected by using a value of a motion vector 
representing a change in motion predicted from a compared result of frames composing 
the respective shots. 

However, Park teaches the histogram of motion direction is detected by using a 
value of a motion vector representing a change in motion predicted from a compared 
result of frames composing the respective shots (column 1 1 line 14-27, column 18 line 
29-31 and fig. 1J). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to combine the teachings of Chakraborty (modified by Toklu) with 
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Parks' teaching of spatial distribution of motion is detected by using a value of a motion 
vector representing a change in motion, to increase the speed and efficiency of data 
search, it has been researched and developed new search techniques which include 
the widely-known character-based search technique and have composite information 
attribute, thereby being suitable for efficient data search of multimedia (column 1 line 
30-36). 

22. Claims 4-6,9-14, and 16 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Chakraborty etal., US-7, 110,454 in view of Toklu et al., US- 
6,549,643. 

Regarding claim 4, Chakraborty discloses a scene classification apparatus (fig. 
1 ) of video for segmenting video into shots (col. 5, line 1 ) for classifying a sequence of 
shots into a slow scene, where the slow scene includes a plurality of continuous shot 
and is thus a larger unit than a shot, comprising: an extractor for extracting from the 
respective shots a shot_ (validation module col. 7 lines 54-55) similar to a current target 
shot (candidate and non-candidate scene change locations (frames) col. 7 lines 36-38 
and fig. 1:19) from shots after a shot before the target shot (compares neighboring 
keyframes col. 7 line 55) only by a predetermined interval (predetermined threshold col. 
14 line 59); and a slow (gradual) scene detector (interframe variance difference col. 7 
line 48-50) for classifying the target shot (candidate and non-candidate scene change 
locations (frames) col. 7 lines 39-38) into a slow scene (gradual) of the shot similar to 
the current target shot based on motion intensity (Chakraborty discloses where the 
output of each of the scene change detection processes are potential shots/scene 
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change location based on the respective metrics (steps 21 1 ,212, and 213), both abrupt 
and gradual. The next steps in the preferred scene change detection process involve 
identifying and validating the scene changes based on various conditions. For instance, 
referring to FIG. 2b, in the preferred embodiment, abrupt scene changes are identified 
where candidate scene change location output from the shot detection processes of the 
interframe and histogram difference metrics are in agreement (step 214). In particular, 
abrupt scene changes are identified by verifying that the conditions regarding both the 
interframe difference metric and the difference are satisfied. It is to be appreciated that 
by integrally utilizing the scene change candidates output from such shot detection 
processes, false alarms in identifying scene changes that may occur due to small 
motion where the interframe difference is high (and thus exceed the threshold in 
equation 1 1 above) will not occur since the condition for the histogram difference for the 
candidate must also be satisfied, (in which case, for small motion, such condition 
typically will not be satisfied), Column 12 line 60 to column 13 line 14 and fig. 2A-2B. 
Further regarding fig. 2A, element 21 1 , outputs potential shot/scene change locations 
based on interframe difference, it is clear to the Examiner that Chakraborty discloses to 
determine if the scene is gradual or static, which reads upon the claimed limitation) of 
the current target shot (candidate and non-candidate scene change locations (frames) 
col. 7 lines 36-38 and fig. 1:19) and the shot similar to the current target shot (key frame 
col. 14 lines 52-57 and fig 2B: 229). 

Chakraborty does not explicitly teach a shot segmentation device to segment the 
video into respective shots. However, Toklu teaches a shot segmentation device to 



Application/Control Number: 10/670,245 Page 25 

Art Unit: 2621 

segmentation device to segment the video into respective shots (video segmentation 
module 12, column 5 line 38-57, and fig 1 element 12). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to incorporate the teachings of Toklu with Chakraborty to generate a 
content based visual summary of video and facilitate digital video browsing and 
indexing, column 3 line 40-43). 

Regarding claim 5, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claim 1. In addition Chakraborty discloses the scene 
classification apparatus of video according to claim 4, wherein the slow (gradual) scene 
detector (interframe variance difference metric computation col. 7 line 48-50 and fig. 1 : 
17) classifies the target shot (candidate and non-candidate scene change locations 
(frames) col. 7 lines 36-38 and fig. 1 :19) into the slow scene (gradual scene) of the shot 
similar to the current target shot (candidate and non-candidate scene changes locations 
(frames), col. 7 line 36-38 and fig. 1 :19) when the motion intensity (interframe difference 
col. 14 lines 30-32) of the shot similar to the current target shot (col. 7 line 36-38 and fig. 
1 :19) is stronger than the motion intensity (interframe difference col. 14 lines 30-32) of 
the current target shot (candidate and non-candidate scene change locations (frames) 
col. 5 line 20-24). 

Regarding claim 6, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claim 1. In addition Chakraborty further discloses 
comprising a first highlight (gradual) scene detector (shot list database col. 8 line 8-1 1 
fig. 1 :21 ) for classifying a scene composed of a plurality of shots continued just before 
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(neighboring key frames col. 7 line 55-59) the slow (gradual) scene into a first highlight 
(gradual) scene. 

Regarding claim 9, Chakraborty discloses a scene classification apparatus (fig. 
1 ) of video for segmenting video into shots (col. 5, line 1 ) and classifying a sequence of 
shots into a scene in which a camera operation has been performed, where the scene 
in which the camera operation has performed includes a plurality of continuous shots 
and is thus a larger unit that a shot, comprising: a detector for detecting a histogram 
relating to motion directions of the respective shots (histogram difference metric col. 8 
line 51-56 and col. 9 line 4-5); and a detector for detecting the scene in which the 
camera operation has been performed based on the histogram of motion direction 
(Chakraborty teaches where camera moves: these shots include the classical camera 
movements i.e. zoom, tilt, pan etc. In a preferred embodiment, the histogram difference 
metric given by equation (3) above is also analyzed to detect scene changes (step 206). 
It is also to be understood that other conventional metrics may be used for this metric 
such as the so called X 2 static given by . chi. .times. 

.times.. function.. function.. function.. function. ##EQU00016## It is known, however, that 
while this statistic is more sensitive to interframe difference across a camera break, it 
also enhances the differences arising out of small object or camera motion, Column 1 1 
line 35-50. Therefore, it is clear to the Examiner that Chakraborty discloses where the 
histogram metrics can detect camera motions, which reads upon the claimed limitation). 

Chakraborty does not explicitly teach a shot segmentation device to segment the 
video into respective shots. However, Toklu teaches a shot segmentation device to 
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segmentation device to segment the video into respective shots (video segmentation 
module 12, column 5 line 38-57, and fig 1 element 12). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to incorporate the teachings of Toklu with Chakraborty to generate a 
content based visual summary of video and facilitate digital video browsing and 
indexing, column 3 line 40-43). 

Regarding claim 10, the combination of Chakraborty as a whole teaches 
everything as claimed above, see claim 1. In addition Chakraborty discloses the scene 
classification apparatus of video according to claim 9, further comprising a zooming 
scene detector (interframe variance difference metric col. 4 lines 15-17) for, when the 
histogram of motion direction (histogram difference metric col. 8 lines 54-57) is uniform 
(col. 8 lines 62-63, i.e. "normal' intensity distribution) and a number of elements of 
respective bins is larger than a reference number of elements (each bin corresponding 
to an intensity range col. 8 line 53), classifying its shot into a zooming scene (gradual 
scene). 

Regarding claim 11, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claim 1. In addition Chakraborty discloses the scene 
classification apparatus of video according to claim 9, further including: detector for 
detecting spatial distribution (variance difference furthermore, the variance difference 
detects the difference within a frame where spatial distribution takes place) of motion of 
each shot; and a panning scene detector (interframe and histogram difference metric 
col. 7 lines 46-48) for detecting whether the respective shots are a panning scene 
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(abrupt scene) based on the histogram of motion direction (histogram difference metric, 
the histogram as well as the interframe difference metric are processed to validate 
candidate scene changes as abrupt col. 7 lines 45-48 and fig. 2A: 202-203) and the 
spatial distribution of motion (variance difference). 

Regarding claim 12, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claim 1. In addition Chakraborty discloses the scene 
classification apparatus of video according to claim 1 1 , wherein when the histogram of 
motion (histogram difference metric) direction is concentrated in one direction and the 
spatial distribution (variance difference furthermore, the variance difference detects the 
difference within a frame where spatial distribution takes place) of motion is uniform 
(typically assumed not to change from frame to frame col. 12 lines 33-34), the panning 
scene detector (interframe and histogram difference metric col. 7 lines 46-48) classifies 
shot into the panning (abrupt) scene. 

Regarding claim 13, Chakraborty discloses a scene classification apparatus of 
video for classifying a sequence of shots into a commercial scene, where the 
commercial scene includes a plurality of continuous shots and is thus larger unit than a 
shot, comprising: a detector for detecting a shot density DS (histogram difference 
metric, a histogram is a graphical display of tabulated frequencies) of the video; and a 
commercial scene detector (Chakraborty discloses video are playing an increasingly 
import role in education and commerce, Column 1 line 15-18. Further, when the 
approximate maximum duration is known, since the frames/sec is always known, the 
maximum frame duration for the scene change is readily ascertainable. If any of the 
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windows have a duration that exceeds this threshold, it may be assumed that the 
window in question is not likely to be a gradual scene change. In such as case, further 
examination becomes necessary. The possibilities are that either the window represents 
just motion or a combination of scene change and motion. In the preferred embodiment, 
if any window has a duration that exceeds the predefined threshold, it is assumed that 
the window represents motion, and consequently all points in such window are turned 
"off' (step 224). All the remaining windows are then identified as candidates for gradual 
scene change, column 14 line 20-35. Chakraborty teaches a predefined shot duration 
(column 13 line 15 to 35); which is equivalent to the shot density. Therefore, since 
Chakraborty discloses videos in education and commerce, and based on the predefined 
window threshold, the scene is either gradual or abrupt, it is clear to the examiner that 
Chakraborty is fully capable of detecting a commercial scene based on the shot density, 
which reads upon the claimed limitation) for detecting the commercial scene (abrupt 
scene) by comparing a shot density (minimum predefined shot duration col. 13 lines 
18-35) detected during a predetermined interval with a predetermined reference shot 
density (column 14 line 21-27). 

23. Claims 14 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Chakraborty et al., US-7, 1 10,454 in view of Toklu et al., US-6,549,643 and in view of 
Yilmaz et al., Shot Detection Using Principal Coordinate System. 

Regarding claim 14, Chakraborty discloses a scene classification apparatus of 
video for a sequence of shots into a commercial scene, where the commercial scene 
includes a plurality of continuous shots and is thus a larger unit than a shot, comprising: 
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a detector for detecting a number of shot boundaries (threshold levels, col. 5 lines 22- 
23, furthermore, histograms are the most common method used to detect shot 
boundaries) of the video; and a commercial scene detector (interframe and histogram 
difference metric, col. 7 lines 46-48) for detecting the commercial scene (abrupt scene 
Chakraborty further discloses video in education and commerce; a video in commerce 
would be a commercial scene) by comparing a number of shot boundaries (threshold 
level col. 5 line 22-23) detected during a predetermined interval with a predetermined 
reference number (column 14 line 21-27), and classifying the scene as the commercial 
scene in response to the comparing indicating that the number of shot boundaries 
detected during the predetermined interval is greater than the predetermined reference 
number. 

Chakraborty does not explicitly teach a shot segmentation device to segment the 
video into respective shots; and classifying the scene as the commercial scene in 
response to the comparing indicating that the number of shot boundaries detected 
during the predetermined interval is greater than the predetermined reference number. 

However, Toklu teaches a shot segmentation device to segmentation device to 
segment the video into respective shots (video segmentation module 12, column 5 line 
38-57, and fig 1 element 12). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to incorporate the teachings of Toklu with Chakraborty to generate a 
content based visual summary of video and facilitate digital video browsing and 
indexing, column 3 line 40-43). 
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Chakraborty modified by Toklu is silent in regards to classifying the scene as the 
commercial scene in response to the comparing indicating that the number of shot 
boundaries detected during the predetermined interval is greater than the 
predetermined reference number. 

However, Yilmaz teaches classifying the scene as the commercial scene in 
response to the comparing indicating that the number of shot boundaries detected 
during the predetermined interval is greater than the predetermined reference number 
(Yilmaz teaches to cluster news video into news and advertisements based, based on 
the shots boundaries detected by principle coordinate approach, we used the minimum 
eigenvalued eigenvector, v3. To define a shot if it is anchor news or advertisement, we 
calculated the mean v3's in a shot and if its below a threshold, it is labeled as anchor 
news; otherwise it is labeled as advertisement, 4.4 Clustering Video Stream into News 
and Commercials. Further disclosed by Yilmaz is that shot boundaries are defined by 
thresholding the rotation changes for the whole video stream, 3.2 Algorithm. Therefore it 
is clear to the Examiner that Yilmaz teaches to determine a commercial based on the 
shot boundary thresholds. Since Chakraborty teaches determining a scene change 
(shot boundary) by comparing each of the computed metrics for the successive frames 
to threshold levels, and Yilmaz teaches to define a shot if it is anchor news or 
advertisement by the calculated v3 and a threshold, Chakraborty now (modified by 
Yilmaz) teaches where a commercial is determined by a shot boundary threshold, which 
reads upon the claimed limitation). 

Thus it would have been obvious to one of ordinary skill in the art at the time of 
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the invention to incorporate the teachings of Yilmaz with Chakraborty (modified by 
Toklu) for improving efficiency of shot detection. 

24. Claims 16 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Chakraborty et al., US-7, 1 10,454 in view of Toklu et al., US-6,549,643 and in view of 
Yilmaz et al., Shot Detection Using Principal Coordinate System. 

Regarding claim 16, Chakraborty (modified by Toklu) as a whole teaches 
everything as claimed above, see claim 11. In addition, Chakraborty teaches wherein 
the video are compressed data (video source may be either compressed or 
decompressed video data, col. 6 lines 45-46), and the spatial distribution (variance 
difference, referring to within the frame, furthermore, MPEG has spatio temporal locator 
capabilities) of motion is detected by using a value of a motion vector of a predictive 
coding image existing in each shot (MPEG, col. 6 lines 51-60, furthermore, MPEG is a 
predictive image coding technique that incorporates tabulating motion vector values). 

10. Claims 7-8 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Chakraborty (US Patent 7,1 1 0,454) in view of Toklu et al., US-6,549,643 and in further 
view of Blanchard US Patent 63471 14). 

Regarding claim 7, Chakraborty fails to teach a detector for detecting the 
intensity of audio signals accompanied by the video. Blanchard teaches a detector for 
detecting intensity of an audio signal (audio levels col. 3 lines 37-51) accompanied by 
the video (col. 2 lines 27-29) into shot. Blanchard also teaches detector for classifying a 
scene composed of a plurality of shots continued before and after a shot with the audio 
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signal intensity stronger than the predetermined intensity (col. 2 lines 17-22) into a 
second highlight scene (gradual scene). 

Taking the combined teaching of Chakraborty (modified by Toklu) and Blanchard 
as a whole, it would have been obvious to one of ordinary skill in the art at the time that 
the invention was made to incorporate detecting the intensity of audio signals 
accompanied by the video as claimed for the benefit of detecting scene changes that 
may generally be identified and distinguished from mere shots changes where the audio 
level will generally remain the same. 

Regarding Claim 8, the combination of Chakraborty (modified by Toklu) and 
Blanchard as whole further teaches everything claimed as applied above; see claims 7. 
In addition Chakraborty teaches a commercial scene detector (interframe and histogram 
difference metric col. 7 lines 46-48, Chakraborty) for classifying the respective shots into 
a commercial scene (abrupt scene), wherein a scene classified into a scene other than 
the first highlight scene (gradual), the second highlight scene (gradual scene) and the 
commercial scene (abrupt scene) is classified into the highlight scene (gradual). 
12. Claim 21 is rejected under 35 U.S.C 103(a) as being unpatentable over 
Nakamura etal., US-2001/0051516 and in view of Pan etal., US-2002/0080162 in view 
of Gonsalves et al., US-6,392,71 0. 

Regarding claim 21, Nakamura teaches a scene classification apparatus 
of video for segmenting video into shots and classifying each scene composed of one or 
more continuous shots based on a content of the scene comprising: a detector for 
detecting a highlight scene (In such a case that a plurality of highlight scenes are 
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detected by the analyzing unit 22, [0208] and fig. 2); extracting and combining means 
for extracting and combining a plurality of highlight scenes (In such a case that a 
plurality of highlight scenes are detected by the analyzing unit 22 from a program during 
a CM broadcasting time range, and the present CM broadcast is commenced, the 
reproducing management unit 27 reproduces a plurality of detected highlight scenes in 
a time sequential manner by equally increasing a reproducing speed, [0208] and fig. 2. 
Nakamura discloses the reproducing management unit 27 reproduces a plurality of 
detected highlight scenes and the highlight scenes are stored in a highlight scene index 
storage unit, (fig. 2, element 25), it is clear to the examiner that in order to reproduce the 
highlight scenes stored in the storage unit, by the reproducing management unit, the 
highlight scenes are retrieved and combined, thus reading upon the claimed limitation). 
Nakamura is silent in regards to inserting means for inserting a video transition effect 
into a combined portion of the respective highlight scenes, the inserting means including 
a dynamic/static scene detector to detect whether a highlight scene is a dynamic scene 
with much motion or a static scene with little motion wherein the inserting means makes 
a type of the video transition effect to be inserted different according to whether the 
highlight scenes to be combined are they dynamic scene or the static scene. 

However, Pan teaches inserting means for inserting a video transition effect into 
a combined portion of the respective highlight scenes, the inserting means including a 
dynamic/static scene detector to detect whether a highlight scene is a dynamic scene 
with much motion or a static scene with little motion (Pan teaches where the pattern of 
a slow motion replay in sports program including very fast movement of objects 
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(persons, ball, and etc.), generally referred to as action shots at block 10. Following the 
action shots at block 10 there may be other shots or video content at block 12 prior to 
the slow motion replay segment in block 14. A special effect, or edit at block 16, is 
almost always present between the normal shots in block 12 and 16 and the slow 
motion replay segment in block 18. After the slow motion replay in block 18, another edit 
effect in block 20, is usually present before resuming normal play. A more detailed 
structure of the slow motion replay 14 of FIG. 1 is shown in FIG. 2. Typically the 
procedure of the slow motion replay includes six components, namely, edit effects in 20, 
still fields 22, slow motion replay 24, normal replay 26, still fields 28, and edit effect out 
30, [0028-0029]. The edit effects in 20 and edit effects out 30, mark the starting and 
end points of the procedure of the slow motion replay 14, and typically are gradual 
transitions, such as fade in/out, cross/additive-dissolve, and wipes. Frequently, the logo 
of the television station will be shown during the edit effects in 20 and edit effects out 
30. Other techniques may likewise be used, col. Therefore, it is clear to the Examiner 
that Pan discloses an inserting means to insert a transition effect into the action shots, 
and determines if the action shot is a slow replay shot or normal speed replay, which 
reads upon the claimed limitation. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to incorporate the teachings of Pan with Nakamura for providing a 
generic system for video analysis which reliably detects semantically significant events 
in a video, [0006]. 
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Nakamura (modified by Pan) is silent in regards to wherein the inserting means 
makes a type of video transition effect to be inserted different according to whether the 
highlight scenes to be combined are the dynamic scene or the static scene. 

However, Gonsalves teaches allowing the video editor to insert a video transition 
effect on a field/frame-by-field/frame basis in order to improve accuracy of the effect 
(Gonsalves, special effect, col. 3 line 11-14 line 24, between two frames col. 4 line 65- 
67, col. 5 lines 50-52, and fig. 3b: 320a-320b). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to incorporate the teachings of Gonsalves with Nakamura 
(modified by Chakraborty) to improve accuracy of the effect. 

Claim 22 is rejected under 35 U.S.C 103(a) as being unpatentable over 
Nakamura et al., US-2001/0051516 and in view of Pan et al., US-6,931,595 in view of 
Gonsalves et al., US-6,392,710 and further in view of Gotoh et al., US-5,801 ,765. 

Regarding claim 22, Nakamura (modified by Pan and Gonsalves) as a whole 
teaches everything as claimed above, see claim 21 . Nakamura is silent in regards to the 
scene classification apparatus of video according to claim 21 , wherein when the 
highlight scene is the dynamic scene, the video transition effect with small change in an 
image mixing ratio is inserted therein, and when the highlight scene is the static scene, 
the video transition effect with large change in the image mixing ratio is inserted therein. 

However, Gotoh discloses where specifically, the scene-change is classified into 
two types depending on how a video changes: one in which a scene changes 
momentarily; and one in which a scene changes gradually. Those generally referred to 
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as the scene-change is the former, i.e., a scene appeared in a moment of pressing a 
record start button (see Fig. 1 1(a)). The latter are those given special effects, such as 
effect and fade, when editing a video (see Fig. 1 1 (b)). Hereinafter, the former and the 
latter are referred to as "momentary scene-change" and "gradual scene-change" 
respectively. In a gradual scene-change, it takes much time that a scene change to 
another. In the Fig. 1 1 (b), pictures H to K comprise a gradual scene-change, column 2 
line 1 7-29 and fig. 11 (a) and 1 1 (b). Therefore, it is clear to the examiner that Gotoh 
discloses a special effect that has a momentary change for a dynamic scene and a 
special effect that takes much time to change for gradual scene, which reads upon the 
claimed limitation. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to incorporate the teachings of Gotoh with Nakamura (modified by 
Pan and Gonsavles) for providing improved image quality. 

Conclusion 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to JESSICA ROBERTS whose telephone number is 
(571)270-1821 . The examiner can normally be reached on 7:30-5:00 EST Monday- 
Friday, Alt Friday off. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Marsha D. Banks-Harold can be reached on (571) 272-7905. The fax 
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phone number for the organization where this application or proceeding is assigned is 
571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Marsha D. Banks-Harold/ 

Supervisory Patent Examiner, Art Unit 2621 



/Jessica Roberts/ 
Examiner, Art Unit 2621 



